Journal: Journal of Proteome Research
Article Title: Transparent Exploration of Machine Learning for Biomarker Discovery from Proteomics and Omics Data
doi: 10.1021/acs.jproteome.2c00473
Figure Lengend Snippet: OmicLearn architecture. Left side: tabular experimental data files can be uploaded to OmicLearn as *.tsv, *.csv, or *.xlsx (Excel format). (1) Internally, OmicLearn uses the NumPy and pandas packages to import and handle data. OmicLearn is an interactive web-based tool built on the Streamlit package (2), which can be used to explore the data interactively. The application can be installed via a one-click installer or accessed online so that it is readily accessible for nonexperts. Right side: OmicLearn has access to the large machine learning libraries of scikit-learn and additional algorithms such as XGBoost. (3) The pipeline is set up to perform classification tasks on omics data sets with multiple cross-validations of results. Various performance metrics are displayed, leveraging the Plotly library. (4) The OmicLearn repository is hosted on GitHub and is open-source. Logos courtesy of the respective library/company (streamlit.io, scikit-learn, xgboost, plotly, github.com, pandas, and NumPy).
Article Snippet: Results are visualized with the graphic Python library Plotly ( https://plotly.com/python ) to generate high-quality interactive graphs, which can be exported as *.pdf, *.png, or *.svg.
Techniques: